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METHODS FOR ALTERING THE NUTRITIONAL VALUE OF A PLANT PROTEJN", VEGETATIVE STORAGE PROTEIN, 
VSP, BY ALTERING THE AMINO ACID CONTENT OF PROTEINS 

FIELD OF THE INVENTION 

The invention relates to a process for the production of proteins having high 
nutritional properties. The methods find particular use in the production of plants 
with increased levels of amino acids having high nutritional properties through the 
5 modification of plant genes. 

BACKGROUND OF THE INVENTION 

Autotrophic organisms can make all of their own amino acids. Other cells 
utilize many preformed amino acids. Humans and other higher animals require a 
10 number of essential amino acids in the diet. These essential amino acids are 
obtained directly or indirectly by eating plants. These essential amino acids 
include lysine, tryptophan, threonine, methionine, phenylalanine, leucine, valine 
and isoleucine. 

Constructing proteins with higher nutritional value has been a long-sought 
15 goal of scientists. Traditionally, agricultural scientists concentrated on breeding 
plants with high nutritional yield. Typically, these new varieties were richer in 
carbohydrates but usually poorer in essential proteins than the wild type varieties 
from which they were derived. 

Seed storage proteins represent up to 90% of total seed protein in seeds of 
20 many plants. They are used as a source of nutrition for young seedlings in the 

period immediately following germination. The genes encoding them are strictly 
regulated, being expressed in a highly tissue specific and stage specific manner. 
These genes are almost exclusively expressed in developing seed. Different 
classes of seed storage proteins may be expressed at different stages in the 
25 development of the seed. They are typically stored in membrane bound organelles 
called protein bodies or protein storage vacuoles. 
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A related group of proteins, the vegetative storage proteins, have similar 
amino acid compositions and are also stored in specialized vacuoles. These 
proteins are generally found in leaves instead of seeds. These proteins are 
degraded upon flowering, and are thought to serve as a nutritive source for 
5 developing seeds. 

Cereal grains and legume seeds which are key protein sources for the 
vegetarian diet are generally deficient in essential amino acids such as methionine, 
lysine, and threonine. Therefore, there is needed means for improving the 
nutritional quality of these proteins. 

10 

SUMMARY OF THE INVENTION 

Compositions and methods for altering the amino acid profiles of proteins 
without introducing conformational changes into the protein are provided. The 
method involves preparing a binding partner and/or an interacting molecule which 
1 5 binds to the native protein and using such interacting molecule to select for 
modified proteins retaining the native conformation. 

The method finds particular use in altering the nutritional value of proteins. 
A plant protein having increased methionine levels is provided. The modified 
protein retains the conformation of the native protein while having significantly 
20 higher levels of methionine. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows VSP homologies. 

VSP-b (same as VSPP) and VSP-a (same as VSPa): Staswick, P.E., 
25 (1 988), Plant Physiol 87, 250-254. 

T.phos (tomato acid phosphatase): Erion, J.L., Ballo, B., May, L., Bussell, 
J., Fox, T.W., & Thomas, S.R., SwissProt database accession number P27061. 

Ph.vulg (Phaseolus vulgaris): Zhon, P-Y., Tanaka, T M Yamauchi, D., & 
Minamikawa, T. (1997), Plant Physiol 113, 479-485. 
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Ar.VSP (Arabidopsis thaliana): Yu, D.Y., Quigley, F., & Mache, R., 
EMBL database accession number X79490. 

Ar.lA-1, Arl7A-l {Arabidopsis thaliana, floral organs): Utsugi, S., 
Sakamoto, Ogura, Y., Murata, M., & Motoyoshi, F. (1996) Plant Mol Biol 32, 
5 759-765. 

Fig. 2 shows proposed VSPp methionine-enriched variants. 

Fig. 3 A shows the hydropathy index computation for sequence VSPp. 

Fig. 3B shows the hydropathy index computation for sequence VSPMetlO. 

Fig. 3C shows the hydropathy index computation for sequence VSPMet20. 
10 Fig. 3D shows the hydropathy index computation for sequence VSPMet30. 

Fig. 4 shows the VSPp-metlO sequence. 

Fig. 5 shows the colony lift assay to detect protein-protein interactions. 



DETAILED DESCRIPTION OF THE INVENTION 

15 Proteins having altered amino acid profiles are provided The proteins can 

be designed to be enriched in essential amino acids, including lysine, methionine, 
tryptophan, threonine, phenylalanine, leucine, valine and isoleucine relative to 
average levels of such amino acids in the native protein. Generally, knowledge 
of the three-dimensional (3-D) structure of a given protein allows one to engineer 

20 amino acid substitutions in a rational manner so as to effect a desired change in the 
property of the protein without compromising the folding process. The present 
invention provides methods for increasing the levels of essential amino acids 
within a protein while at the same time the altered protein has the conformation of 
the native protein. 

25 The present invention provides methods for altering the amino acid content 

of a protein whose 3-D structure is unknown or unavailable. The method may also 
provide an easy method for assessing changes in a protein in which the structure of 
the protein is known but tools for confirming conformation of the protein may be 
unavailable. The "conformation" of a protein refers to the spatial arrangement of 

30 substituent groups of the molecule. The polypeptide chain of a protein has only 

-3- 
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one conformation (or a very few) under normal biological conditions of 
temperature and pH. This, referred to as the "native conformation," confers 
biological activity. The native conformation is sufficiently stable so that the 
protein can be isolated and retained in its native state. Therefore, it is important to 
5 be able to change the amino acid content of a protein, yet at the same time have the 
protein retain its biological activity. 

The methods of the invention are useful for making amino acid changes 
within proteins whose conformation is unknown or unavailable. Such proteins 
include the vegetative storage protein which is believed to play a significant role in 

1 0 supplying amino acids for protein deposition during seed fill, and other proteins of 
the seed. The methods of the invention may be used to modify the amino acid 
composition of any protein. Examples of such proteins include but are not limited 
to wheat endosperm purothionine (Mak and Jones (1976) Con, J. Biochem. 
22:83 J); albumins (Higgins et al. (1986) J. Biol Chem. 261 :1 1 124); and 

15 methionine rich proteins (Pedersen et al. (1986)7. Biol Chem. 261:6279; Kirihara 
etal. (1988) Gene 71:359; Musumura et al. (1989) ite. Mol Biol 12:123). 

The methods of the invention comprise altering the amino acid composition 
of a protein to produce an engineered protein. The engineered protein will retain 
the conformation and activity of the native protein yet have a modified or altered 

20 amino acid content. In this manner, levels of particular amino acids of interest can 

be increased or decreased. Of particular interest, is to increase the levels or 
numbers of essential amino acids in the proteins. By essential amino acid is 
intended, lysine, tryptophan, threonine, methionine, phenylalanine, leucine, valine, 
isoleucine, and cysteine. However, it is recognized that the amino acid 

25 composition can be changed in various ways, as long as the changes do not affect 

the conformation of the final protein. 

The proteins of the invention have been engineered or modified to contain 
altered amino acid levels. The engineered protein retains the conformation of the 
native protein. The method involves preparing binding partners and/or interacting 

30 molecules to the native protein and utilizing these interacting molecules to 
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determine whether the engineered protein folds correctly. By "binding partner" or 
"interacting molecule" is intended a molecule which is capable of binding or 
interacting with the proteins of interest. Such binding partners or interacting 
molecules include antibodies, monoclonal antibodies, antibody fragments, proteins, 
5 modified proteins, nucleotide sequences, such as aptomers, chemical compounds 
(e.g. carbohydrates, etc.), or combinations thereof. The interacting molecules also 
encompass polypeptides that have an intrinsic affinity to the protein of interest, 
particularly such polypeptides that are capable of binding with the protein of 
interest to form an oligomeric complex. For example, VSP-alpha binds VSP with 
1 0 high affinity and could be used as an interacting molecule for the altered VSP 
protein. 

Methods for antibody production are known in the art. See, for example, 
Antibodies, A Laboratory Manual, Harlow and Lane (Eds.), Cold spring Harbor 
Laboratory Press, Coldspring Harbor, NY (1988), and the references cited therein. 

15 See also, Radka et al (1983) J. Immunol 128:2804; and Radka et al (1984) 
Immunogenetics 19:63. All of which are herein incorporated by reference. 

Once antibodies, preferably monoclonal antibodies, are available which 
bind to the native protein, such antibodies can be used to select for modified 
proteins which retain the conformation of the native protein. Strategies to identify 

20 residues within a protein that might tolerate amino acid substitution include 

mutational analysis, secondary structure prediction, homology comparison, and the 
like. Such strategies can be used to identify amino acids within the protein that 
will tolerate amino acid substitution. 

By mutational analysis is intended mutagenic PCR and DNA shuffling. 

25 See, for example, Stemmer, W.P. (1994) Nature 370:389-391 ; and Stemmer, W.P. 
(1994) Proc. Natl Acad. Sci. USA 91:10747-10751, herein incorporated by 
reference. Such methods can be used to generate phage display libraries of protein 
genes containing random mutations. Phage display is an in vitro selection 
technology which allows for a foreign protein or peptide to be displayed on the 

30 surface of filamentous phage, linking the phenotype of the phage to its genotype. 
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Molecular repertoires with sufficient diversity can be generated using such 
technology. Proteins which exhibit the correct conformation, that is, the native 
conformation, can be selected for by the ability to bind antibodies recognizing 
conformational domains of the native protein. See also Methods in Enzymology, 
Combinatorial Chemistry, John N. Abelson (Ed.), Vol. 267, Academic Press, Inc., 
San Diego, CA, herein incorporated by reference. Once correctly-folded protein 
variants are determined, subsequent isolation and sequencing of the variants 
reveals the tolerated sites for mutations. Alternatively, correctly folded variants 
may be identified by other screening or selection methods such as filter lift-assay 
and ELISA. 

Substitutions may also be incorporated at secondary structure prediction 
sites. Structural features of the protein are important for proper folding. Sequence 
analysis tools such as the GCG (Wisconsin sequence analysis package, Genetics 
Computer Group, University Research Park, 575 Science Drive, Madison, WI) and 
PC/GENE (Oxford Molecular Group, 2105 S. Bascom Avenue, Suite 200, 
Campbell, CA) can be used to analyze protein sequence for secondary structure 
features such as helices, sheets and turns. In this manner, it can be determined 
whether a particular stretch of amino acids may reside on the surface of the protein. 
Residues on the surface of a protein tolerate substitution more readily than buried 
residues without compromising the structure of the protein. Utilizing these 
algorithms, predicted turns and surface regions of the proteins can be made. 
Therefore, predictions can be made into which regions amino acid substitutions can 
be made without affecting conformation. 

Sites for amino acid substitution can also be determined by homology 
comparison to other proteins. Nature has tested the tolerance of protein residues to 
substitution as exemplified in the sequences of proteins such as globins and 
cytochromes from several different species, members of which have the same fold. 
See, for example, Hampsey et al (1988) FEBS Lett 231:275; Bashford et al 
(mi)J.MoLBioL 196:199; Lesk and Chothia (1980) J. M>/. Biol. 136:225. 
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In designing proteins of the invention, hydrophobic residues, such as 
alanine, cysteine, valine, isoleucine, leucine, methionine, phenylalanine, and 
tryptophan may be substituted for one another without undue perturbation of the 
structure. Such residues generally occur in the hydrophobic core of the protein. 
5 See, Bowie etal (1990) Science 247:1306-1310; and Baldwin and Matthews 

(1994) Curr. Opin. Biotech 5:396-402. See also Ladunga and Smith (1997) ProL 
Eng. 10:187-196, herein incorporated by reference. Generally, residues that 
substitute for one another in related sequences do so by conserving the physico- 
chemical properties of the residue and folding of the protein thus conserving the 3- 
10 D structure of the protein. 

Therefore, the protein to be modified can be compared with homologous 
proteins. Amino acids that are critical to the function and/or folding of the protein 
would be expected to be conserved over time. Therefore, predictions can be made 
as to which amino acids can be substituted without affecting the conformation or 
15 folding of the protein. 

Such selected amino acid substitutions can be made by DNA sequencing, 
site-directed mutagenesis, or other methods which substitute one amino acid with 
any other amino acid. 

Once the amino acid substitutions have been made and the conformation 
20 confirmed by antibody binding, the protein can be expressed using known 

expression systems. Where necessary, the DNA encoding the protein can be 
synthesized using known techniques. Likewise, the nucleotide sequence encoding 
the protein can be contained within expression cassettes. 

Utilizing the methods of the invention, proteins can be constructed which 
25 have increased nutritional quality. That is, the essential amino acid content within 

the protein can be increased to represent at least about 5 -about 10%, preferably at 
least about 10-about 20%, more preferably at least about 20-about 40% of the total 
amino acid content in the protein. 

In the same manner, the amino acid content of a subject protein can be 
30 altered to include at least about 10% amino acid substitutions, additions or 
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deletions, about 20% or even up to about 30% to about 40%. It is recognized that 
the limitation will be the activity of the altered. The present invention provides a 
convenient and ready mechanism to test the activity of the protein by its ability to 
bind the interacting molecule. 

For convenience for expression in plants, the nucleic acid encoding the 
modified peptides or proteins of the invention can be contained within expression 
cassettes. The expression cassette will comprise a transcriptional initiation 
region linked to the nucleic acid encoding the peptide of interest. Such an 
expression cassette is provided with a plurality of restriction sites for insertion of 
the gene or genes of interest to be under the transcriptional regulation of the 
regulatory regions. 

The transcriptional initiation region, the promoter, may be native or 
homologous or foreign or heterologous to the host, or could be the natural 
sequence or a synthetic sequence. By foreign is intended that the transcriptional 
initiation region is not found in the wild-type host into which the transcriptional 
initiation region is introduced. 

The transcriptional cassette will include the in 5N-3N direction of 
transcription, a transcriptional and translational initiation region, a DNA 
sequence of interest, and a transcriptional and translational termination region 
functional in plants. The termination region may be native with the 
transcriptional initiation region, may be native with the DNA sequence of 
interest, or may be derived from another source. Convenient termination regions 
are available from the Ti-plasmid of A. tumefaciens, such as the octopine 
synthase and nopaline synthase termination regions. See also, Guerineau et al. , 
(1991) Mol Gen. Genet. 262:141-144; Proudfoot (1991) Cell 64:671-674; 
Sanfacon et al. (1991) Genes Dev. 5:141-149; Mogen et al. (1990) Plant Cell 
2:1261-1272; Munroe et al. (1990) Gene 92:151-158; Ballas et al. 1989) Nucleic 
Acids Res. 77:7891-7903; Joshi etal. (1987) Nucleic Acid Res. 75:9627-9639. 

Where appropriate, the gene(s) expressing the modified proteins may be 
optimized for increased expression in the transformed plant. In this manner, the 
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sequences can be synthesized using monocot, dicot or particular plant; i.e. maize, 
soybean, sorghum, wheat, etc., preferred codons for improved expression. 
Methods are available in the art for synthesizing plant preferred genes. See, for 
example, U.S. Patent Nos. 5,380,831, 5,436, 391, and Murray et al. (1989) 
5 Nucleic Acids Res. 77:477-498, herein incorporated by reference. 

The expression cassettes may additionally contain 5' leader sequences in 
the expression cassette construct. Such leader sequences can act to enhance 
translation. Translation leaders are known in the art and include: picorna virus 
leaders, for example, EMCV leader (Encephalomyocarditis 5' noncoding region) 
10 (Elroy-Stein, O., Fuerst, T.R., and Moss, B. (1989) PNAS USA, 56:6126-6130); 
poty virus leaders, for example, TEV leader (Tobacco Etch Virus) (Allison et al. 

(1986) ; MDMV leader (Maize Dwarf Mosaic Virus); Virology, 154:9-20), and 
human immunoglobulin heavy-chain binding protein (BiP), (Macejak, D.G., and 
P. Sarnow (1991) Nature, 355:90-94; untranslated leader from the coat protein 

15 mRNA of alfalfa mosaic virus (AMV RNA 4), (Jobling, S.A., and Gehrke, L., 

(1987) Nature, 525:622-625; tobacco mosaic virus leader (TMV), (Gallie, D.R. 
et al (1989) Molecular Biology of RNA, pages 237-256; and maize chlorotic 
mottle virus leader (MCMV) (Lommel, S.A. et al (1991) Virology, 
87:382-385). See also, Della-Cioppa et al. (1987) Plant Physiology, 54:965-968. 

20 Other methods known to enhance translation can also be utilized, for example, 

introns, and the like. 

The expression cassettes may contain one or more than one nucleic acid 
sequences to be transferred and expressed in the transformed plant. Thus, each 
nucleic acid sequence will be operably linked to 5' and 3' regulatory sequences. 

25 Alternatively, multiple expression cassettes may be provided. 

Generally, the expression cassette will comprise a selectable marker gene 
for the selection of transformed cells. Selectable marker genes are utilized for 
the selection of transformed cells or tissues. Such selectable marker genes are 
known in the art. See generally, G. T. Yarranton (1992) Curr. Opin. Biotech., 

30 5:506-511; Christopherson et al. (1992) Proc. Natl. Acad. Sci. USA, 59:6314- 
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6318; Yao et al. (1992) Cell 77:63-72; W. S. Reznikoff (1992) MoL Microbiol., 
6:2419-2422; Barkley etaL (1980) The Operon, pp. 177-220; Hu etaL (1987) 
Cell, 45:555-566; Brown etaL (1987) Cell, 49:603-612; Figge era/. (1988) 
Cell, 52:713-722; Deuschle a/. (1989) Proc. Afa//. /lazd. AcL USA, 56:5400- 
5 5404; Fuerst et aL (1989) Proc. NatL Acad, ScL USA, 56:2549-2553; Deuschle 
et aL (1990) Science, 245:480-483; M. Gossen (1993) Ph.D. Thesis, University 
of Heidelberg; Reines etaL (1993) Proc. NatL Acad. ScL USA, 90:1917-1921; 
Labow etaL (1990) MoL Cell Bio., 70:3343-3356; Zambretti et al. (1992) Proc. 
NatL Acad. ScL USA, 59:3952-3956; Bairn etaL (1991) Proc. NatL Acad. ScL 

10 USA, 55:5072-5076; Wyborski et aL (1991) Nuc. Acids Res., 79:4647-4653; A. 
Hillenand-Wissman (1989) Topics in MoL andStruc. BioL, 70:143-162; 
Degenkolb et al. (1991) Antimicrob. Agents Chemother., 55:1591-1595; 
Kleinschnidt et aL (1988) Biochemistry, 27:1094-1104; Gatz et al. (1992) Plant 
J., 2:397-404; A. L. Bonin (1993) PhD Thesis, University of Heidelberg; 

15 Gossen et al. (1992) Proc. Natl. Acad. ScL USA, 59:5547-5551; Oliva etal. 
(1992) Antimicrob. Agents Chemother. , 56:913-919; Hlavkae/a/. (1985) 
Handbook of Exp. Pharmacology, 78; Gill et al. (1988) Nature 334:721-724; 
DeBlock et al. (1987) EMBO J., 6:2513-2518; DeBlock et al. (1989) Plant 
Physiol., 97:691-704; Fromm et al. (1990) 8:833-839; Gordon-Kamm et al. 

20 (1990) 2:603-618. Such disclosures are herein incorporated by reference. 

The nucleotide sequences of interest of this invention can be introduced 
into the genome of the desired host organism in a variety of techniques known in 
the art. For the purposes of this invention, it will be appreciated to those skilled 
in the art that any conventional transformation vector may be used as long as it is 

25 capable of transforming the organism of choice and it does not have restriction 
sites in common with those comprising the final master insertion cassette. 
Hence, the detailed experimental description of transformation vectors is given 
by way of illustration only. 

Vector systems are known for the transformation of yeast and bacterial 

30 cells. For yeast, these include but are not limited to autonomously replicating 
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plasmids (see, for example, Stearns etal (1990) Methods Enzymol. 185:280- 
297); 2-micron circle yeast DNA sequences (see, for example, Hollenberg (1982) 
Curr. Topics Microbiol, Immunol 96:119-144; Broach (1983) Methods Enzymol. 
101:307-325; MacKay (1983) Methods Enzymol. 101:325-343; Armstrong (1989) 
5 BioTechnology 13:165-192; Rose (1990) Methods Enzymol 185:234-279); 

linearized vector DNA (see, for example, see, for example, Takita et al (1997) 
Yeast 13:763-768); artificial chromosome vectors (Burke (1987) Science 
236:806-812); restriction site bank plasmids (Davison (1987), U.S. Patent No. 
4,657,858, and Methods Enzymol 153: 34-54); delta-integration vectors (see, for 
10 example, Lee and Da Silva (1997) Biotechnol Prog. 13:368-373); and 

Agrobacterium-b&sed vectors (see, for example, Bundock et al (1995) EMBOJ. 
14:3206-3214; Piers etal (1996) Proc. Natl Acad. Scl USA 93:1613-1618; 
Risseeuw et al (1996) Mol Cell Biol 16:5924-5932); and Shuttle Vectors (see, 
for example, Schneider (1991) Methods Enzymol 194:373-388; Singh (1997) 
15 Methods Mol Biol 62:113-130). See generally Hinnen (1980) Curr. Topics 
Microbiol Immunol 96:101-117; Nombela (1985) Revis. Biol Cel 4:1-25; 
Parent (1985) Yeast 1(2):83-138; West (1988) BioTechnology 10:387-404; 
Schena (1991) Methods Enzymol 194:389-398; Schneider (1991) Methods 
Enzymol 194:373-388; and Singh (1997) Methods. Mol Biol 62:113-130. 
20 Vector systems used for bacterial transformation include, but are not 

limited to, yeast shuttle vectors (see, for example, Ward (1990) Nucleic Acids 
Res. 18(17):5319; Strathern (1991) Methods Enzymol 194:319-329; Soni (1992) 
Nucleic Acids Res. 20(21) 5852; Nacken (1994) Nucleic Acids Re. 22:1509-1510; 
Wehmeier (1995) Gene 165:149-150); pBR322 and related plasmids such as 
25 pBR327 and pKC7 (see, for example, Rao and Rogers (1979) Gene 7:79-82; 

Talmadge and Gilbert (1980) Gene 12:235-241; Smith et al. (1995) Microbiology 
141(pt. 1): 181-188); pATH vectors (see, for example, Koerner et al (1991) 
Methods in Enzymol 194:477-490); yeast plasmids (see, for example, Marcil 
(1992) Nucleic Acids Res. 20:917); and natural replicon ColEI and related 
30 plasmids such as P15A, F, RSF1010, and R616 (see, for example, Muhlenhoff 
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and Chauvat (1996) Mol Gen. Genet. 252:93-100; Sakai and Komano (1996) 
BioscL BiotechnoL Biochem. 60:377-382; Lee and Henk (1997) Vet. Microbiol 
54:369-374); herein incorporated by reference. 

A number of vector systems are also known for the introduction of 
5 foreign or native genes into mammalian cells. These include SV40 virus (see, 
for example, Okayama et al (1985) Molec. Cell BioL 5:1136-1142); Bovine 
papilloma virus (see, for example, DiMaio et al. (1982) Proc. Natl Acad. Sci. 
USA 79:4030-4034); adenovirus (see, for example, Morin et al (1987) Proc. 
Natl Acad. Sci. USA 84:4626; Yifan et al (1995) Proc. Natl Acad. Sci. USA 
10 92:1401-1405; Yang etal (1996) Gene Ther. 3:137-144; Tripathy et al. (1996) 
Nat. Med. 2:545-550; Quantin et al (1992) Proc. Natl. Acad. Sci USA 89:2581- 
2584; Rosenfeld etal. (1991) Science 252:431-434; Wagner (1992) Proc. Natl. 
Acad. Sci. USA 89:6099-6103; Curiel etal (1992) Human Gene Therapy 3:147- 
154; Curiel (1991) Proc. Natl Acad. Sci USA 88:8850-8854; LeGal LaSalle et 
15 al (1993) Science 259:590-599); Kass-Eisler et al (1993) Proc. Natl Acad. Sci 
USA 90:11498-11502); adeno-associated virus (see, for example, Muzyczka et 
al (1994)7. Clin. Invest. 94:1351; Xiao etal. (1996)7. Virol 70:8098-8108); 
herpes simplex virus (see, for example, Geller et al (1988) Science 241:1667; 
Huard etal (1995) Gene Therapy 2:385-392; U.S. Patent No. 5,501,979); 
retrovirus-based vectors (see, for example, Curran et al (1982) J. Virol, 
44:674-682; Gazit etal. (1986)7. Virol, 60:19-28; Miller (1992) Curr. Top. 
Microbiol Immunol 158:1-24; Cavanaugh etal. (1994) Proc. Natl Acad. Sci. 
USA 91:7071-7075; Smith etal. (1990) Mol. Cell Biol 10:3268-3271); herein 
incorporated by reference. 

Methods of the present invention can be used to facilitate assembly of 
nucleotide sequences of interest for transformation of any plant. In this manner, 
genetically modified plants, plant cells, plant tissue, seed, and the like can be 
obtained. The transformation vector and hence method of transformation chosen 
will depend on the type of plant or plant cell, i.e. monocot or dicot, targeted for 
transformation. Suitable methods of transforming plant cells include 

-12- 



SUBSTITUTE SHEET (RULE 26) 



WO 99/29882 PCT/US98/26209 

microinjection (Crossway et al (1986) Biotechniques 4:320-334); electroporation 
(Riggs etal. (1986) Proc. Natl Acad. Sci USA 83:5602-5606); Agrobacterium- 
mediated transformation (Hinchee et al (1988) Biotechnology 6:915-921); direct 
gene transfer (Paszkowski et al (1984) EMBO J. 3:2717-2722); and ballistic 
5 particle acceleration (see, for example, Sanford et al U.S. Patent 4,945,050; 
WO91/10725 and McCabe et al (1988) Biotechnology 6:923-926). Also see, 
Weissinger et al (1988) Annual Rev. Genet. 22:421-477; Sanford etal. (1987) 
Particulate Science and Technology 5:27-37 (onion); Christou et al (1988) Plant 
Physiol. 87:671-674 (soybean); McCabe etal. (1988) BioTechnology 6:923-926 
10 (soybean); Datta et al. (1990) Biotechnology 8:736-740 (rice); Klein et al. (1988) 
Proc. Natl. Acad. Sci. USA 85:4305-4309 (maize); Klein etal (1988) 
BioTechnology 6:559-563 (maize); WO91/10725 (maize); Klein etal (1988) 
Plant Physiol 91:440-444 (maize); Fromm et al (1990) BioTechnology 
8:833-839; and Gordon-Kamm et al (1990) Plant Cell 2:603-618 (maize); 

1 5 Hooydaas-Van Slogteren and Hooykaas (1984) Nature (London) 3 1 1 :763-764; 

Bytebier et al (1987) Proc. Natl Acad. Sci USA 84:5345-5349 (Liliaceae); De 
Wet et al. (1985) In The Experimental Manipulation of Ovule Tissues, ed. G.P. 
Chapman etal, pp. 197-209 (Longman, N.Y.) (pollen); Kaeppler et al. (1990) 
Plant Cell Reports 9:415-418; and Kaeppler et al. (1992) Theor. Appl Genet. 

20 84:560-566 (whisker-mediated transformation); D 1 Halluin et al (1992) Plant 

Cell 4:1495-1505 (electroporation); Li etal (1993) Plant Cell Reports 12:250-255, 
and Christou and Ford (1995) Annals of Botany 75:407-413 (rice); Osjoda et al. 
(1996) BioTechnology 14:745-750 (maize \izAgrobacterium tumefaciens); all of 
which are herein incorporated by reference. 

25 The following examples are offered by way of illustration and not by way 

of limitation. 

EXPERIMENTAL 
Three complementary strategies, namely, mutational analysis, secondary 
30 structure prediction, and homology comparison (see below) have been used to 
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identify amino acids within VSPp (vegetative storage protein) that might tolerate 
methionine substitution. Together, results from these strategies facilitated the 
design of three VSP variants with increasing methionine content. 



1 . Mutational analysis 

The simple premise behind this strategy was that if one prepared 
monoclonal antibodies that recognized the wild-type VSP, then these same 
antibodies would, if the mutant proteins folded correctly, also recognize the 
engineered proteins. As a first step, therefore, mice were injected with VSP 
purified from soybean leaves, and a panel of 21 monoclonal antibodies recognizing 
wild-type VSP has been characterized by ELISA. These antibodies also recognize 
VSPct expressed and purified from Pichia pastoris. 

The following two approaches can be implemented to generate either 
random or "semi-rational" mutations in VSPp. Mutagenic PCR and DNA 
shuffling (Stemmer, W.P. (1994) Nature 370, 389-391; Stemmer, W.P. (1994) 
Proc. Natl Acad, Set. USA 91, 10747-10751) can be used to generate phage 
display libraries of VSPp genes containing random mutations. Since these 
mutations could alter the structure of VSP, correctly-folded variants can be 
selected for by their ability to bind a set of monoclonal antibodies recognizing 
different conformational domains of wild-type VSP. Likewise, correctly-folded 
variants can be selected by their abilities to homo-heterodimerize. Correctly- 
folded VSP variants (i.e., those retaining the ability to bind VSP-specific 
conformational antibodies and homo/heterodimerize) can be selected by phage 
display technology or screened using a filter lift assay (see methods). Subsequent 
isolation and sequencing of these variants reveals the tolerated mutations. Amino 
acid substitutions which do not compromise the VSPp structure may be good 
candidates for site-directed methionine substitutions. 

In addition to this "random" approach, a method for the "semi-rational" 
incorporation of methionines into VSP was developed. Although the 3-D structure 
of VSP is uncertain, secondary structure prediction of the protein (see strategy 2 
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below) allowed "semi-rational" methionine substitutions. Analysis of VSPP 
homology with tomato acid phosphatase, a protein with 45% identity to VSPP, as 
well as other homologs allowed additional methionine substitutions (see strategy 3 
below). Two methods were designed by which to introduce these substitutions. 
The first method involves DNA shuffling in the presence of excess methionine- 
encoding oligos which, by protein secondary structure predictions, are 
complementary to multiple regions of the VSPP gene corresponding to protein 
loops. The second novel method employed overlap PCR of segments of the VSPP 
gene corresponding to protein loops which have been amplified with the 
methionine-encoding oligos. The methods by which these oligos (corresponding 
to, for example, twenty-two different methionine substitutions) are introduced into 
VSPp result in the production of a library of phage-displayed VSP variants; 
theoretically each variant contains zero to twenty-two additional methionines. 
Subsequent phage display and biopanning of these libraries against VSP-specific 
monoclonal antibodies can lead to the identification of residues in VSP which can 
accommodate methionine without significantly altering the structure of the protein. 

A VSPfJ mutant library was made by error prone PCR methodology (see 
below). From this pool of mutants, a filter lift assay (see methods) was performed 
to identify properly-folded mutant VSPP based on the ability to bind to either 
VSPa or a VSP-specific monoclonal antibody. Using VSPa as the antigen in a 
filter lift assay (Fig. 5) 18 out of 50 VSPP variants tested bound VSPa. Sequence 
analysis of 15 of these variants revealed a total of 84 point mutations which 
correlate with 58 AA substitutions and 25 silent mutations. Together these 
represent 51 different residues within the 218 AA VSPp. 

2 . Secondary structure prediction 

Structural features of a protein are very important for proper folding. 
Sequence analysis tools such as the GCG (Wisconsin Sequence Analysis Package, 
Genetic Computer Group, University Research Park, 575 Science Drive, Madison, 
WI) and PC/GENE (Oxford Molecular Group, 2105 S. Bascom Avenue, Suite 200, 
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Campbell, CA) were used to analyze the VSPp sequence for secondary structure 
features such as helices, sheets and turns and for determining whether a particular 
stretch of amino acids might reside on the surface of the protein. Residues on the 
surface of a protein would likely tolerate substitution more readily than a buried 
residue without compromising the structure of the protein. Using these algorithms, 
numerous predicted turns and surface regions of the protein were identified. Many 
of these regions are expected to tolerate methionine substitution. For example 
residues at positions 25, 30, 32, 37, 44, 65, 67, 102, 121, 130, 160, 163, 164, 169, 
1 98, 202, and 207 in VSPp occur in predicted turn regions and were substituted 
with Met (Table 1). 

3 . Homology comparison 

Over time, nature has tested the tolerance of protein residues to 
substitution, and this is exemplified in the sequences of proteins such as globins 
and cytochromes from several different species, members of which have the same 
fold (Hampsey, M.D., Das, G., Sherman, F. (1988) FEES Lett 231,275; Bashford, 
D., Chothia, C. & Lesk, A.M. (1987) J. Mol Biol. 196, 199; Lesk, A.M. & 
Chothia, C. (1980)7. Mol Biol 136,225). These and other studies have 
demonstrated that hydrophobic residues (such as Ala, Sys, Val, He, Leu, Met, Phe 
and Trp) almost always occur in the hydrophobic core of the protein and that they 
may substitute for each other without undue perturbation of the structure (Bowie, 
J.U., Reidhaar-Olson, J.F., Lim. W.A., & Sauer, R.T. (1990) Science 247, 1306- 
1310; Baldwin, E.P., & Matthews, B.W. (1994) Curr. Opin. Biotech 5, 396-402). 
Indeed, it has been observed that "Residue positions that can accept a number of 
different side chains, including charged and highly polar residues, are almost 
certain to be on the protein surface. Bowie et al (1990) Science 247:1306-1310, 
have Residue positions that remain hydrophobic, whether variable or not, are likely 
to be buried within the structure". Furthermore, in a recent comprehensive analysis 
of substitution patterns in several databases of multiply aligned protein sequences, 
Ladunga and Smith (1997) ProL Eng. 10:187-196, have concluded that the overall 
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emphasis is on the preservation of three dimensional structure of the protein and 
that residues that substitute for each other in related sequences do so by conserving 
the physico-chemical properties of the residue and the folding of the protein. In the 
case of VSP, this evolutionary data was utilized by comparing the homology of 
5 VSPp with six homologous proteins (Fig. 1). Amino acids that are critical to the 

function and/or folding of a protein would be expected to be conserved over time. 
For example, cysteine 7 and 29 are conserved in all seven of the homologous 
proteins aligned in Fig. 1 . These residues are involved in forming a disulfide bond 
that may be expected to be of importance to the structure of the protein. 
1 0 In summary, analysis of the VSPP sequence with its homologs led to the 

identification of 3 1 residues (out of 218 amino acids) that in all liklihood will 
tolerate methionine substitution. 



Engineering VSP(3 for increased methionine 

15 

Rational 

Wild-type VSPp contains 1 .4% methionine. Using the three strategies 
described, three different VSPp variants with increasing amounts of methionine 
have been proposed (9.6%, 14.2%, 17.9%, Fig. 2). The overall amino acid 

20 composition in each of these constructs is presented in Table 2. Construct VSPp- 

met20 (14.2% Met) contains the same 18 Met substitutions as the VSPp-metlO 
derivative plus an additional 1 1 Met residues. Likewise VSPp-met30 contains the 
same 29 Met substitutions as VSPp-met20 plus an additional 7 Met residues. 
Mutational analysis of VSPp resulted in the mutation of 5 1 different amino acids 

25 out of the 2 1 8 amino acid protein. Although these mutations were not methionine 
substitutions, the types of tolerated substitutions were examined for their relevance 
to substitution to a hydrophobic amino acid. For example, positions 50, 67, 93, 
127, 150, and 164 tolerated mutation to a hydrophobic amino acid (Table 1). 
Therefore, it is possible that this same position might tolerate substitution to 

30 methionine. Positions 62, 67, 76, 127, and 164 are hydrophobic amino acids in 
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VSPp - wild type. The observation that these positions tolerate substitution at all 
suggests they would more readily tolerate a conservative substitution (/.e. t 
hydrophobic amino acid to hydrophobic amino acid, Table I). Since residues 32, 
50, 65, 67, 76, 93, 127, 150, 160, and 202 allowed non-conservative mutations, it is 
5 possible that these positions would tolerate mutation to methionine (Table 1). In 
every case where these amino acids were not changed from or to a hydrophobic 
amino acid in the mutational analysis, at least one additional strategy (i.e., 
secondary structure or homology comparison) was used to rationalize methionine 
substitution at the particular position. In summary, in the three methionine 
10 enriched constructs proposed, 12 residues (out of a total of 36) were selected based 
at least in part on mutational analysis. More specifically, mutational analysis 
indicated 6/1 8 methionine substitutions in construct VSPP-metlO, 9/29 in construct 
VSPP-met20, and 12/36 in VSPp-met30 (Table 1). As mentioned, mutational 
analysis revealed 51 different positions within VSPp tolerant to substitutions, 
15 Interestingly, 25/51 (49%) of the mutated positions are located in regions of the 
protein predicted to exist as turns, 1 7/5 1 (33%) in helices, and 9/5 1 (18%) in p- 
sheets. These percentages are significantly different from the predicted distribution 
of turns (25%), helices (25%) and P-sheets (50%), indicating that, as expected, the 
regions of the protein most likely to be located on the surface (e.g., turns) can more 
20 readily accommodate substitutions without compromising the structure of the 

protein. This suggests the importance of protein secondary structure prediction as 
one of the strategies utilized in the identification of residues for methionine 
substitution. 

Since protein turns are generally more surface-exposed regions that do not 
25 contribute greatly to the overall structure of the protein, these regions were targeted 
for methionine substitution. In fact, out of the 36 positions selected for methionine 
substitution, 1 7 (47.2%) are predicted to occur in turns. In contrast, because P- 
sheets are protein structural elements that generally occur at the core of the protein, 
these regions were avoided in selecting sites for methionine substitution. Out of 
30 the 36 positions selected for methionine substitution, only 7 (19.4%) are predicted 
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to occur in P-sheets. Nearly all of these residues were hydrophobic in wild-type 
VSPP and were thought to tolerate methionine based upon the homology 
comparison strategy. Additionally, 12 (33.3%) of the residues selected for 
methionine substitution in the three constructs are predicted to occur in helices. In 
summary, secondary structure prediction is the strategy responsible, at least in part, 
for 17/36 sites targeted for methionine substitution. More specifically, secondary 
structure prediction correlates with the selection of 7/1 8, 14/29, and 17/36 amino 
acids for methionine substitution in constructs VSPp-metlO, VSPp-met20, and 
VSPp-met30, respectively (Table 1). 

Homology comparison was a very informative strategy in selecting residues 
that might tolerate methionine substitution. Accordingly, methionine substitutions 
in VSPP were made by adhering to the following rules and also summarized in 
Table 1: 

(a) Conserved residues (highlighted in blue in Fig. 1) were defined as 
those residues occurring in more than 5 of the 7 homologs. These were not 
targeted for substitution. The exceptions were: at residue numbers 19, 37, 146 and 
179 (one of the homologs contained a methionine residue); at positions 67, 80, 130 
and 169 (conserved hydrophobic amino acid exchanges observed in at least one 
sequence) and at position 50 (non-conservative changes from Asn to Ser/Cys in 
two sequences). 

(b) Similarly, non-conserved positions were defined as those containing 
residues with different side-chain properties. Several positions in VSPp were 
correlated with non-conservative amino acids in the homplogs (e.g., 5, 19, 25, 30, 
37, 44, 60, 62, 65, 67, 72, 76, 80, 90, 97, 102, 121, 127, 130, 135, 142, 146, 150, 
164, 169, 179, 189, 198, 202, 207, and 217). Such residues likely reside on the 
surface/turns of the protein and were considered less important for protein function 
and/or folding and therefore targeted for substitution with methionine. 

(c) In addition, some positions in which at least one other hydrophobic 
amino acid was observed among homologs (e.g., 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 
13, 14, 15, 16, 17, 18, 19, 20, 21, 25, 30, 37, 44, 60, 62, 65, 67, 72, 76, 90, and 97) 
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were also expected to tolerate substitution to the hydrophobic amino acid 
methionine. Exceptions to this were cases in which the hydrophobic amino acid 
was completely conserved in all 6 homologs (e.g., Val 49, Leu 77, Leu 1 10, Leu 
1 14, Leu 145, He 157, Leu 158, He 186, Val 187, Leu 197 and Leu 210). In these 
5 cases, the possibility that the specific hydrophobic amino acid in the wild-type 

protein may be playing a role critical for the proper structure and/or function of the 
protein was considered. To avoid disturbing this possible role, the substitution of 
any residue that is completely conserved in all 6 homologs examined was not 
proposed. 

10 (d) Six residues within VSPp that were expected to tolerate methionine 

substitution were identified based on the presence of methionine in analogous 
positions in homologs (e.g., 19, 37, 44, 146, 179, and 202). 

A few additional considerations were observed in selecting amino acids that 
might tolerate methionine substitution. 

1 5 (e) We avoided altering histidine residues due to their potential 

importance in phosphatase activity of VSPP (Table 2 and DeWald, D.B., Mason, 
H.S., & Mullet, J.E. (1992) J. Biol Chem. 267, 15958-15964). 

(f) Since VSPp is a glycoprotein, this feature may be important for the 
stability and/or function of the protein, substitution of potential glycosylation sites 

20 was avoided (e.g., Asn 94). 

(g) In addition, wherever possible, charged residues such as Lys, Arg, 
Glx, Asx were left untouched to preserve the hydrophobic/hydrophilic balance of 
the protein (Table 2 and Fig. 3A-D). While wild-type VSPp has a calculated 
charge of -4, VSPp-metlO, VSPP-met20, and VSPp-met30 have calculated charges 

25 of -7,-7, and -5, respectively. 

As a strategy, homology comparison facilitated, at least in part, the 
selection of 31/36 of the residues proposed for methionine substitution. These 
selections correlate with 18/1 8, 28/29, and 31/36 residues for constructs VSPP- 
metlO, VSPp-met20, and VSPp-met30, respectively (Table 1). 
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Several of the amino acids selected for methionine substitution in the three 
constructs resulted from more than one strategy. In fact, the majority (20/36) of 
the targeted residues resulted from at least two strategies, with a few (4/36) 
resulting from all three strategies. 

Experimental results 

A synthetic gene for methionine enriched VSPp-metlO has been 
constructed. This synthetic gene differs from wild-type VSPp in that it encodes 
eighteen additional methionines (Fig. 4). Also, a few silent point mutations were 
introduced into this construct to create unique restriction sites. To test whether the 
proposed VSPp-metlO gene was correctly folded, the construct was cloned into the 
phagemid vector pCANTAB-5E and the abilities of the expressed proteins to bind 
VSP-specific conformational monoclonal antibodies in a filter lift assay were 
compared. The results indicate that the VSPP-metl 0 gene was able to bind the 
same antibodies as wild-type VSPp. This suggests that VSPp-metlO may be 
correctly folded in an E. coli secretion system. 

Together, these interdisciplinary approaches should not only result in the 
engineering of a nutritionally-enhanced VSP, but also provide clues to the structure 
of VSP - a protein for which no 3D structure is available. This approach is 
applicable to any protein of interest. 

Methods 

1. Random mutation of vegetative storage protein (VSPp) by error-prone 
PCR 

The VSPp gene was amplified by mutagenic PCR using primers flanking 
the gene. 
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Reaction 1 


Reaction 2 


Reaction 3 


Reaction 4 


lOmM Tris-HCl 


lOmM Tris-HCl 


lOmM Tris-HCl 


lOmM Tris-HCl 


50mM KC1 


50mM KC1 


50mM KC1 


50mMKCl 


9.5mM MgCl 2 


9.mM mgClj 


9.mM mgCl 2 


9.mM mgCl 2 


0.5mM MnCI 2 


0.5mM MnClj 


0.5mM MnCl 2 


0.5mM MnCl 2 


5 ug/ml BSA 


5 ug/ml BSA 


5 jig/ml BSA 


5 ug/ml BSA 


600pmol VSP 
template 


600pmol VSP 
template 


600pmol VSP 
template 


600pmol VSP 
template 


0.1 (am each 
primer 


0.1 umeach 
primer 


0.1 um each 
primer 


0.1 jim each 
primer 


2mM dATP 


200uM dATP 


200uM dATP 


200uM dATP 


200uM dCTP 


2mM dCTP 


200uM dCTP 


200uM dCTP 


200nM dGTP 


200nMdGTP 


2mM dGTP 


200uMdGTP 


200}iM dTTP 


200nMdTTP 


200uM dTTP 


2mM dTTP 


2 Units Taq Pol 


2 Units Taq Pol 


2 Units Taq Pol 


2 Units Taq Pol 



1 cycle (1 min. at 95EC, 1 min. at 51EC, 3 min. at 72EC) 
16 cycles (1 min. at 91 EC, 1 min. at 51 EC, 3 min. at 72EC) 
5 1 cycle (1 min. a91EC, lmin.at51EC, 5min.at72EC) 

The products of these four reactions were pooled, and the band 
corresponding to the mutagenized VSPp gene was purified from an agarose gel, 
digested with S//I and Noil and cloned into the phagemid vector pCANTAB-5E. 

10 

2. Filter lift assay 

Fifty E. coli colonies containing randomly mutated VSPP genes were 
picked as small patches to an SB agar plate containing glucose and ampicillin. 
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Patches were allowed to grow overnight at 37EC and were then transferred to a 
nitrocellulose filter. On the surface of an SB agar plate containing ampicillin and 
IPTG, this filter was placed on top (cell-side up) of a separate blocked filter to 
which the antigen (e.g., VSPa) had been coated. During an overnight incubation at 
5 30EC, the cells expressed the VSPp variant they encoded. These proteins were 
able to diffuse through the top filter and, if correctly folded, bind the antigen- 
coated filter below. The next day, the antigen-coated filter was washed with PBS- 
0.05% tween and incubated with HRP/anti-e tag conjugate. Since the VSPp 
mutants are cloned into the pCANTAB-5E vector which fuses a C-terminal epitope 
1 0 tag (e-tag) to the VSPp protein variants, bound proteins were detected by this 
antibody in combination with enhanced chemiluminescence detection. 
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Table 1 

Proposed methionine Substitution 

Homology comparison 



VSPB 
position 

Construct 1 (9.6% Met) 



Mutational 
Analysis ' 



Secondary 
Structure 2 



Original A.A. 
hydrophobic? 



Met in 
homolog? 3 



# homologs 
hydrophobic 4 



5 


V 




: 


Y 




3 of 6 


19 


I 


_ 




Y 


Y-Ar. VSP 


6 of 6 


30 


V 




T 


Y 




2 of 6 


37 


1 




T 


Y 


Y-T, phos 


6 of 6 


44 


1 


- 


T 


Y 


Y-T, phos 


lof6 


60 


R 


- 


- 


- 




5 of 6 


62 


V 


V-A 


- 


Y 


_ 


6 of 6 


67 


I 


I-T,L 


T 


Y 


_ 


5 of 6 


72 


I 


- 


- 


Y 


_ 


6 of 6 


76 


V 


V-G 


- 


Y 


_ 


5 of 6 


121 


L 


- 


T 


Y 




6 of 6 


127 


I 


I-T.L 


- 


Y 


_ 


3 of 6 


146 


K 




- 


. 


Y-T, phos 
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1 Amino acid substitution observed in the mutational analysis. For example, at position 62,a 
valine to alanine substitution was observed 
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2 "V indicates turn predicted by secondary structure analysis of VSPB. 

3 "Y" indicates the presence of Methionine in the designated VSP homolog. 

4 Includes only aliphatic hydrophobic amino acids such as Leu, He, Val, and Met. 

Table 2 



Amino Acid Composition of VSPB-WT and Methionine-Enriched Variants 
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All publications and patent applications mentioned in the specification are 
indicative of the level of those skilled in the art to which this invention pertains. 
All publications and patent applications are herein incorporated by reference to the 

-25- 



SUBST1TUTE SHEET (RULE 26) 



WO 99/29882 PCT/US98/26209 

same extent as if each individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way 
of illustration and example for purposes of clarity of understanding, it will be 
5 obvious that certain changes and modifications may be practiced within the scope 
of the appended claims. 
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CLAIMS : 

1 . A method for altering amino acid composition of a native protein of 
interest whose conformation is unavailable, said method comprising introducing 
amino acid changes into said protein to create an engineered protein, said 
engineered protein having the conformation of the native protein wherein said 
conformation of the engineered protein is confirmed by binding with an interacting 
molecule which binds with the native protein. 

2. The method of Claim 1, wherein said interacting molecule is an 
antibody. 

3. The method of Claim 2, wherein said antibody is a monoclonal 
antibody. 

4. The method of Claim 1, wherein said amino acid changes are made 
to increase levels of essential amino acids in the engineered protein. 

5. The method of Claim 4, wherein said essential amino acid is 
selected from the group consisting of methionine, tryptophan, lysine, valine, 
phenylalanine, isoleucine, leucine, theronine and cysteine. 

6. The method of Claim 5, wherein said essential amino acid is 
methionine. 

7. The method of Claim 1, wherein said amino acid changes are 
introduced into predetermined sites. 

8. The method of Claim 7, wherein said predetermined site is 
determined by secondary structure prediction or homology comparison. 

9. The method of Claim 1, wherein said amino acid changes are 
introduced at random. 

1 0. The method of Claim 9, wherein said amino acid changes are 
produced by mutagenic PCR, DNA shuffling, or phage display methodology. 

1 1 . The method of Claim 10, wherein correctly folded variants are 
confirmed by filter lift assay or ELISA. 
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12. The method of Claim 4, wherein said essential amino acids are 
increased to represent 5% of the total amino acid content of the protein. 

13. The method of Claim 4, wherein said essential amino acids are 
increased to represent 10% of the total amino acid content of the protein. 

14. The method of Claim 1, wherein said protein is vegetative storage 

protein. 

15. A method for altering amino acid composition of a native protein of 
interest, said method comprising introducing amino acid changes into said protein 
to increase nutritional value to create an engineered protein, said engineered 
protein to create an engineered protein, said engineered protein having the 
conformation of the native protein wherein said conformation of the engineered 
protein is confirmed by binding with an interacting molecule which binds with the 
native protein. 

16. The method of Claim 15, wherein said interacting molecule is an 
antibody. 

1 7. The method of Claim 1 6, wherein said antibody is a monoclonal 
antibody. 

1 8. The method of Claim 1 5, wherein said amino acid changes are made 
to increase levels of essential amino acids in the engineered protein. 

1 9. The method of Claim 1 8, wherein said essential amino acid is 
selected from the group consisting of methionine, tryptophan, lysine, valine, 
phenylalanine, isoleucine, leucine, theronine and cysteine. 

20. The method of Claim 19, wherein said essential amino acid is 
methionine. 

2 1 . The method of Claim 1 5, wherein said amino acid changes are 
introduced into predetermined sites. 

22. The method of Claim 2 1 , wherein said predetermined site is 
determined by secondary structure prediction or homology comparison. 
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23. The method of Claim 15, wherein said amino acid changes are 
introduced at random. 

24. The method of Claim 23, wherein said amino acid changes are 
produced by mutagenic PCR, DNA shuffling, or phage display methodology. 

25. The method of Claim 24, wherein correctly folded variants are 
confirmed by filter lift assay or ELISA. 

26. The method of Claim 18, wherein said essential amino acids are 
increased to represent 5% of the total amino acid content of the protein. 

27. The method of Claim 18, wherein said essential amino acids are 
increased to represent 10% of the total amino acid content of the protein. 

28. The method of Claim 15, wherein said protein is vegetative storage 

protein. 

29. An engineered protein having altered amino acid composition, 
wherein said amino acid composition has been altered by introducing amino acid 
changes into said protein, wherein said engineered protein binds to an interacting 
molecule capable of binding with a corresponding native protein. 

30. The protein of Claim 29 wherein said interacting molecule is an 
antibody. 

3 1 . The protein of Claim 30, wherein said antibody is a monoclonal 
antibody. 

32. The protein of Claim 29, wherein said amino acid changes increase 
the levels of essential amino acids in the protein. 

33. The protein of Claim 32, wherein said essential amino acid is 
selected from the group consisting of methionine, tryptophan, lysine, valine, 
phenylalanine, isoleucine, leucine, theronine and cysteine. 

34. The protein of Claim 33, wherein said essential amino acid are 
increased to represent 5% of the total amino acid content of the protein. 

35. The protein of Claim 32, wherein said essential amino acid are 
increased to represent 10% of the total amino acid content of the protein. 
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36. The protein of Claim 35, wherein said essential amino acid is 
methionine. 

37. The protein of Claim 29, wherein said amino acid changes are 
introduced into predetermined sites. 

38. The protein of Claim 37, wherein said predetermined site is 
determined by secondary structure prediction or homology comparison. 

39. The protein of Claim 29, wherein said amino acid changes are 
introduced at random. 

40. The protein of Claim 39, wherein said amino acid changes are 
produced by mutagenic PCR, DNA shuffling, or phage display methodology. 

41 . The protein of Claim 29, wherein said protein is vegetative storage 

protein. 

42. A nucleotide sequence encoding the protein of Claim 29. 

43. A nucleotide sequence encoding the protein of Claim 32. 

44. A transformed plant containing within its genome the nucleotide 
sequence of Claim 42. 

45. A transformed plant containing within its genome the nucleotide 
sequence of Claim 43. 

46. A stably transformed plant having inserted into its genome a 
chimeric gene said gene encoding an engineered protein having altered amino acid 
composition wherein said protein, wherein said engineered protein binds to an 
interacting molecule which binds with a corresponding native protein. 

47. The plant of Claim 46, wherein said amino acid changes increase 
the levels of essential amino acids in the protein. 

48. The plant of Claim 47, wherein said essential amino acid is selected 
from the group consisting of methionine, tryptophan, lysine, valine, phenylalanine, 
isoleucine, leucine, theronine and cysteine. 

49. The plant of Claim 48, wherein said essential amino acid are 
increased to represent 5% of the total amino acid content of the protein. 
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50. The plant of Claim 48, wherein said essential amino acid are 
increased to represent 10% of the total amino acid content of the protein. 

5 1 . The plant of Claim 46, wherein said plant is a dicot. 

52. The plant of Claim 46, wherein said plant is a monocot. 

53. The plant of Claim 52, wherein said monocot is maize. 

54. The plant of Claim 52, wherein said dicot is soybean. 

55. Seed of the plant of Claim 46. 

56. Seed of the plant of Claim 52. 

57. Seed of the plant of Claim 53. 
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HYDROPATHY INDEX COMPUTATION FOR SEQUENCE VSPM30. 
TOTAL NUMBER OF AMINO ACIDS IS: 218. 
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HYDROPATHIC INDEX OF VSPM30 FROM AMINO ACID 1 TO AMINO ACID 218. 
COMPUTED USING AN INTERVAL OF 9 AMINO ACIDS. (GRAVY-5.31 ). 
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Figure 4 
VSPP-metlO sequence 

Sfil 

1 GGCCCAGCCGGCCAGATCTTCGGAGATGAAATGCGCTAGCTTTAGGCTTGCTGTGGAAGC 60 

CCGGGTCGGCCGGTCTAG AAGCCTCTACTTTACG CGATCGAAATC CGAACGACACCTTCG 

61 ACACAACATGCGAGCCTTTAAAACCATTCCTGAAGAGTGCATGGAACCAACAAAGGACTA 120 
TGTGTTGTACGCTCGGAAATTTTGGTAAGGACTTCTCACGTACCTTGGTTGTTTCCTGAT 

121 CATGAATGGCGAACAATTTCGAATGGACTCTAAAACAGTTAACCAACAGGCCTTCTTTTA 180 
GTACTTACCGCTTGTTAAAGCTTACCTGAGATTTTGTCAATTGGTTGTCCGGAAGAAAAT 

181 TGCTAGTGAAATGGAAATGCATCACAACGACATGTTTATATTCGGCATGGATAACACCAT 240 
ACGATC ACTTTACCTTTACGTAGTGTTG CTGTACAAATATAAG C CGTAC CTATTGTGGTA 

241 GCTCTCTAATATCCCATACTATGAAAAACATGGATATGGGGTGGAGGAATTTAATGAAAC 300 
CGAGAGATTATAGGGTATGATACTTTTTGTACCTATACCCCACCTCCTTAAATTACTTTG 

301 CTTATATGATGAATGGGTTAACAAGGGCGACGCACCGGCATTGCCAGAGACTCTTAAAAA 360 
GAATATACTACTTACCCAATTGTTCCCGCTGCGTGGCCGTAACGGTCTCTGAGAATTTTT 

361 TTACAACAAGCTGATGTCCCTTGGCTTCAAGATGGTATTCTTGTCAGGAAGGTACCTTGA 420 
AATGTTGTTCGACTAC^GGGAACCGP-AGTTCTA.CCA.TAAGA.ACAGTCCTTCCATGGAACT 

421 CAAAATGGCCGTAACAGAAGCAAACCTAATGAAGGCTGGCTTCCACACATGGGAGCAGTT 450 
GTTTTACCGGCATTGTCTTCGTTTGGATTACTTCCGACCGAAGGTGTGTACCCTCGTCAA 

481 AATTCTCAAGGATCCACATCTTATGACTCCAAATGCACTTTCATACAAATCAGCAATGAG 540 
TrAAGAGTTCCTAGGTGTAGAATACTGAGGTTTACGTGAAAGTATGTTTAGTCGTTACTC 

541 AG AG AAT ATG TTG AGG C AG GG AT AC AG AATTGTTGGAATG AT TGGTGATCAATGG AG CG A 600 
TCTCTTATACAACTCCGTCCCTATGTCTTAACAACCTTACTAACCACTAGTTACCTCGCT 

601 TCTGCTTGGAGACCACATGGGCGAATCTAGAACCTTTAAGCTTCCTAATCCCATGTACTA 660 
AGACGAACCTCTGGTGTACCCGCTTAGATCTTGGAAATTCGAAGGATTAGGGTACATGAT 

661 CATGGAGGCGGCCGC 675 
GTACCTCCGCCGGCG 
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COLONY LIFT ASSAY TO DETECT PROTEIN-PROTEIN INTERACTIONS 




COLONIES ON MASTER FILTER 
VSPa-COATED FILTER 
SB+AMP+IPTG 



LAYER ANTIGEN (VSPa)-COATED FILTER AND 
COLONY LIFT FILTER ON SB-IPTG-PLATE 




MASTER FILTER OF COLONIES 

EXPRE<5«? V«5PA MUTANT*? AT qn°r CONTAINING VSPfl MUTANTS 

EXPRESS VSPJ3 MUTANTS AT 30 C CLONED INTO PHAGEMID VECTOR 



CORRECTLY-FOLDED VSP0 VARIANTS DIFFUSE THROUGH 
THE MASTER FILTER AND BIND TO THE VSPa-COATED FILTER 

WASH FILTER 



VSPa-COATED FILTER IS INCUBATED 
WITH HRP/ANTI-e TAG CONJUGATE 




DEVELOP FILTER WITH SUBSTRATE (ECL) 



DEVELOPED VSPa- 
COATED FILTER 



FIG. 5. 
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